Back

Molecular & Cellular Proteomics

Elsevier BV

Preprints posted in the last 90 days, ranked by how well they match Molecular & Cellular Proteomics's content profile, based on 158 papers previously published here. The average preprint has a 0.11% match score for this journal, so anything above that is already an above-average fit.

1
Unbiased proteomics following inflammasome activation identifies caspase targets in primary intestinal epithelial cells

Gibson, A. R.; Diaz Ludovico, I.; Clair, G. C.; Hutchinson-Bunch, C. M.; Adkins, J. N.; Rauch, I.

2026-04-22 immunology 10.64898/2026.04.20.719683 medRxiv
Top 0.1%
42.1%
Show abstract

Inflammasomes are cytosolic innate immune sensors that, once activated by a pathogenic threat, lead to activation of the inflammatory Caspase-1. Inflammasome activation and its consequences have been studied extensively in myeloid cells and in overexpression systems. Recent studies have identified cell type specific effects that are not fully explained by the known cleavage targets of Caspase-1. Here, we identified targets of caspase cleavage using mass spectrometry in primary intestinal epithelial cells by specifically activating the NAIP-NLRC4 inflammasome. We have taken an unbiased approach and developed a novel method for analyzing mass spectrometry data for evidence of caspase activity. Our approach can also be applied to existing proteomic datasets to establish the presence of caspase activity under various biological conditions. These results lay the groundwork for future studies on mechanisms of caspase-induced processes such as intestinal epithelial cell extrusion.

2
From variability to consensus: rescoring harmonizes peptide identification across diverse search engines and datasets

Winkelhardt, D.; Berres, S.; Uszkoreit, J.

2026-03-06 bioinformatics 10.64898/2026.03.04.709532 medRxiv
Top 0.1%
29.3%
Show abstract

Peptide-spectrum match (PSM) rescoring has become standard in proteomics workflows, improving peptide identification accuracy across diverse search engines. Despite the availability of multiple rescoring strategies, systematic comparisons spanning several search engines, datasets, and database configurations remain limited. Here, we benchmarked seven publicly available search engines, evaluating standard target-decoy-based false discovery rate (FDR) estimation alongside Percolator, MS2Rescore, and Oktoberfest across four datasets acquired on different mass spectrometry platforms and searched against protein databases of varying size and composition. Rescoring substantially increased identification consensus and reduced variability between search engines, with prediction-based approaches yielding the largest gains. While database size had limited impact for human datasets, it significantly affected identification rates on a metaproteomic dataset. Entrapment-based evaluation indicated generally adequate FDR control across methods, although prediction-based rescoring exhibited a slightly higher tendency toward FDR underestimation in specific configurations. Overall, advanced rescoring strategies harmonize peptide identification outcomes across search engines, thereby enhancing robustness and comparability in proteomics analyses. However, careful feature selection and appropriate database choice remain essential to ensure reliable FDR control and optimal performance across diverse experimental settings.

3
Single-cell proteomics reveals proteome remodeling and cellular heterogeneity during NGF-induced PC12 neuronal differentiation

Ebrahimi, A.

2026-03-26 neuroscience 10.64898/2026.03.25.710659 medRxiv
Top 0.1%
26.3%
Show abstract

1Single-cell proteomics (SCP) enables direct measurement of cellular heterogeneity during dynamic biological processes. Here, we applied an SCP workflow to investigate proteome diversity during nerve growth factor (NGF)-induced differentiation of PC12 cells. Differentiated PC12 cells are highly adherent and prone to aggregation, complicating single-cell sample preparation. To address this challenge, sample handling was optimized using gentle dissociation, anti-adhesive conditions, and rapid processing immediately prior to cell isolation. Individual cells were deposited using a refined thermal inkjet (TIJ) dispensing system, enabling accurate single-cell placement with minimal sample loss. Inclusion of the mild nonionic surfactant n-dodecyl-{beta}-D-maltoside (DDM) improved recovery of membrane-associated and other low-solubility proteins. Coupled with high-sensitivity liquid chromatography-ion mobility-mass spectrometry, this workflow consistently quantified approximately 2,000-3,000 proteins per cell across differentiation stages. Single-cell proteomic profiles acquired over the differentiation time course revealed clear separation between undifferentiated and NGF-treated cells by Day 6. At later stages (Days 4-6), cells further partitioned into two distinct subpopulations with protein expression patterns not evident in bulk measurements. Dimensionality reduction and non-negative matrix factorization identified multiple proteomic states coexisting within the same differentiation stages, characterized by coordinated differences in pathways related to intracellular trafficking, protein translation, and neuronal structural organization. Together, these results show that while global proteome remodeling during PC12 differentiation is captured in both bulk and single-cell data, single-cell proteomics uniquely resolves functionally distinct cellular subpopulations that are masked in population-averaged analyses.

4
Stoichiometry-dependent specificity in biotin enrichment: a benchmarking framework for proximity labeling proteomics

Zala, C. A.; Trueba Sanchez, M. C.; van den Bor, J.; Willemsens, T.; Verweij, F. J.; Altelaar, M.; Stecker, K.

2026-05-11 molecular biology 10.64898/2026.05.07.723439 medRxiv
Top 0.1%
23.4%
Show abstract

Proximity labeling methods (including, BioID, TurboID, ultraID), along with surface proteomics and microdomain mapping, enable proteome-wide identification of spatially proximal proteins via MS-based analysis. These workflows require specific enrichment of biotinylated proteins using affinity purification, yet enrichment specificity can often be compromised by non-specifically bound proteins. As labeling strategies are increasingly applied to complex biological samples with low protein input or low biotin stoichiometry, accurately distinguishing true targets from background becomes a major analytical challenge. Despite its critical impact on data quality and interpretation, the influence of biotinylation level and protein input on enrichment performance remains poorly characterized, limiting the reliability of proximity labeling experiments. To address this, we establish a quantitative benchmarking framework that systematically evaluates biotin enrichment under controlled conditions, including scenarios of low biotin stoichiometry. Using this setup, we show that enrichment specificity strongly depends on biotin stoichiometry: higher levels of biotinylation in samples yield high specificity, whereas low biotinylation increases non-specific background. Reduced protein input further limits recovery of true targets, yet maintains enrichment specificity, highlighting sensitivity constraints of enrichment-based workflows. We apply this framework to biotinylated extracellular vesicle (EV) cargo uptake in recipient cells using ultraID-CD63 labeling. Detection of the most abundant EV cargo proteins under low biotinylation conditions indicates that current workflows approach the lower bounds of biotin enrichment sensitivity. Together, these standards provide a practical reference for evaluating and optimizing biotin enrichment workflows, supporting quantitative and reproducible proximity labeling in proteomics.

5
Detergent-Free Nuclear-Cytoplasmic Fractionation Enables Spatially Resolved PELSA for Enhanced Nuclear Drug Target Identification

Cai, D.; Zou, K.; Wang, J.; Zhu, H.; Ma, Y.; Yang, D.; Zhang, X.; Yan, J.; Zou, L.; Wang, K.; Ye, M.

2026-04-13 biochemistry 10.64898/2026.04.10.717665 medRxiv
Top 0.1%
23.3%
Show abstract

Accurate identification of drug target proteins remain major challenges in proteomics-based target discovery, particularly for low-abundance nuclear proteins that are difficult to detect because of the complexity of whole-cell lysates. Here, we developed a detergent-free nuclear-cytoplasmic fractionation strategy compatible with peptide-centric local stability analysis (PELSA), which markedly improves detection of nuclear drug targets. Using K562 cells, we demonstrated that mild detergent-free fractionation enables high-fidelity nuclear-cytoplasmic separation with minimal cross-contamination. When coupled with PELSA, this workflow significantly increases the number of detected nuclear targets relative to whole-cell analysis. Benchmarking with well-characterized nuclear drugs, including the histone deacetylase inhibitor panobinostat and the RNA polymerase II inhibitor -amanitin, our results showed improved identification of canonical nuclear targets. Broad profiling of staurosporine target further revealed expanded kinase target coverage by combining the results of nuclear and cytoplasmic fraction, with the CLK family kinases detected exclusively in the nuclear fractions. Additionally, we showed that PELSA can also be performed on intact nucleus level. Collectively, these findings establish detergent-free nuclear-cytoplasmic fractionation-PELSA as a robust and scalable strategy for spatially resolved drug target identification, improving sensitivity for nuclear and low-abundance proteins.

6
Proteome analyses reveal Endoplasmic Reticulum stress-induced changes in protein abundance associated with Ube2j2 deficiency in human cell culture

Dahlberg, C. L.; Zinkgraf, M.; Laugesen, S. H.; Soltoft, C. L.; Ginebra, Q.; Bennett, E. P.; Hartmann-Petersen, R.; Ellgaard, L.

2026-04-03 bioinformatics 10.64898/2026.03.31.715661 medRxiv
Top 0.1%
23.2%
Show abstract

The unfolded protein response (UPR) helps reinstate cellular proteostasis upon an accumulation of misfolded proteins in the endoplasmic reticulum (ER), in part through ER-associated degradation (ERAD). Ube2j2 is an ER-localized E2 ubiquitin-conjugating enzyme that participates in ERAD. We used mass spectrometry analysis of cultured U2OS cells to investigate how the loss of Ube2j2 affects the cellular proteome in response to tunicamycin-induced ER stress. We constructed a network of twelve statistically distinct modules of protein abundance profiles across conditions. We describe the Gene Ontology annotations for each module along with the "hub gene" proteins whose abundance levels most closely adhere to each modules protein abundance profile. Our analysis identifies known Ube2j2-associated pathways (e.g., the UPR and ERAD) and cellular functions that were previously unassociated with Ube2j2 (e.g., RNA metabolism, ER-Golgi transport, and cell-cycle progression). These data are available via ProteomeXchange with identifier PXD076153 and provide avenues for further investigation into the cellular functions of Ube2j2 under basal and ER-stressed conditions.

7
From Peaks to Power: Systematic Evaluation of Chromatographic Sampling Reveals Determinants of Quantification and Biological Discovery in DIA Proteomics

Cantrell, L. S.; Just, S.; Stukalov, A.; Farokhzad, O. C.; Batzoglou, S.

2026-05-16 bioinformatics 10.64898/2026.05.13.724964 medRxiv
Top 0.1%
23.1%
Show abstract

Modern DIA proteomics increasingly emphasizes throughput and depth for large-cohort studies, but methods are often optimized using proxy metrics that can mask losses in quantifiable signal and statistical power. Here, we evaluate how datapoints per peak and other chromatographic features jointly contribute to quantification and downstream biological discovery. Using a matrix-matched calibration curve dataset, we checked how the number of datapoints per peak (DPPP) affects the limits of detection and quantification (LOD/LOQ). Reduced DPPP minimally affected LOD but substantially degraded LOQ. Feature modeling and nonparametric association analyses identified precursor peak area as the strongest feature-level predictor of LOQ, whereas DPPP showed weaker and context-dependent effects. Simulations of chromatographic peak integration recapitulated these trends, showing that increased sampling primarily improves integration precision, while quantitative accuracy is strongly governed by peak height and peak shape. Finally, when comparing 20 cancer vs 20 control plasma samples processed with Seer Proteograph, the decrease in DPPP led to a loss of statistical significance for proteins with low-abundance precursors. These findings argue that DIA optimization should prioritize LOQ and statistical power metrics - not identifications alone - by balancing sampling density with chromatographic peak height and quality to maximize useful biological signal.

8
Trypsin exhibits exopeptidase-like activity toward N-terminal arginine that biases proteomic analyses

Ambrose, E. A.; Kandasamy, G.; Meulener, M. M.; Zhang, F.

2026-05-16 biochemistry 10.64898/2026.05.15.725550 medRxiv
Top 0.1%
23.1%
Show abstract

Many proteomics protocols rely on enzymatic digestion of complex protein mixtures to generate peptides with predictable cleavage patterns for the mass spectrometry analysis. One of the most utilized enzymes, trypsin, is classically defined as a serine endopeptidase with high specificity for cleaving peptide bonds on the C-terminal side of internal lysine and arginine residues. Accordingly, trypsin is not expected to remove the N-terminal arginine, which may arise through posttranslational modification such as arginylation or by proteolysis exposing internal residues as the new N-termini. N-terminal arginine plays important biological roles, including functioning as an N-degron and modulating protein interactions/signaling through its positive charge. Curiously, prior mass spectrometry-based studies utilizing trypsin to identify proteins bearing N-terminal arginine have frequently reported low and inconsistent yields, suggesting potential systematic bias in current proteomic approaches. Here, we explored whether trypsin would affect the integrity of the N-terminal arginine. By using antibodies specifically recognizing N-terminal arginine of different peptides, and by using mass spectrometry peptide analysis, we show that trypsin can remove N-terminal arginine residues in an exopeptidase-like manner. This effect occurs across a range of digestion conditions consistent with standard proteomic workflows, on peptides or whole proteins, and depends on trypsin concentration, incubation time, and catalytic activity. In addition, we show that the alternative arginine-cleavage enzyme Arg-C can also affect N-terminal arginine in a sequence-dependent context. In contrast, Lys-C and LysargiNase do not exhibit such effects, providing suitable alternative digestion strategies. Together, these findings reveal an unappreciated enzymatic behavior of arginine-cleaving proteases and suggest that their widespread use may systematically compromise the detection of N-terminal arginine in proteomic studies.

9
A Deep Quantitative Proteome Turnover Platform for Human iPSC-derived Neurons

Hao, L.; Frankenfield, A. M.; Shih, J.; Zhang, T.; Ni, J.; Mazli, W. N. A. b.; Lo, E.; Liu, Y.; Wang, J.

2026-03-16 neuroscience 10.64898/2026.03.14.711828 medRxiv
Top 0.1%
23.0%
Show abstract

Quantitative evaluation of protein turnover in human neurons is crucial for understanding neuron homeostasis and guiding drug development for neurological diseases. However, measuring protein turnover in postmitotic neurons remains challenging due to the high dynamic range of protein half-lives and limited proteome coverage in SILAC (Stable Isotope Labeling by Amino acids in Cell culture) experiments. Despite broad applications of dynamic SILAC proteomics to measure protein turnover in rodent tissues and primary neurons, few studies have measured protein half-lives in human neurons with limited proteome coverage. Here, we established a comprehensive platform to quantify protein half-lives in human induced pluripotent stem cell (iPSC)-derived neurons. By integrating optimized dynamic SILAC labeling in human neuron cultures, extensive peptide fractionation, optimized data-dependent and data-independent LC-MS/MS acquisition methods, and a streamlined computational pipeline, we achieved deep and accurate measurement of 10,792 protein half-lives from 162,854 unique peptides. We then compared the protein turnover and abundances in iPSC-derived glutamatergic cortical neurons and spinal motor neurons, revealing globally conserved proteome dynamics alongside subtype-specific differences consistent with specialized neuronal functions. To enable broad community access, we created NeuronProfile (www.neuronprofile.com), an interactive web platform for exploring protein turnover, abundance, and subcellular location in human neurons. Together, this work provides a comprehensive analytical platform to assess human neuronal proteostasis and a foundational resource for neurological disease research and therapeutic development.

10
Importance of taking Single Amino Acid Variant and accessory proteome variability into account in Data Independent Acquisition Proteomics: illustrated with Legionella pneumophila analysis

Dupas, A.; Ibranosyan, M.; Ginevra, C.; Jarraud, S.; Lemoine, J.

2026-04-03 bioinformatics 10.64898/2026.04.01.715759 medRxiv
Top 0.1%
23.0%
Show abstract

Understanding allelic variability is crucial for elucidating intrinsic bacterial mechanisms and distinguishing phenotypic profiles. However, such variability poses a major challenge for the reliable identification of proteins in data-independent acquisition (DIA) proteomics. To address this, we developed an analytical workflow that integrates protein sequence variability to enhance proteome coverage. Fifteen Legionella pneumophila isolates were analyzed using DIA-NN, with spectral libraries generated either from a reference proteome or incorporating allelic variability. Our workflow includes protein clustering and subsequent protein inference from these clusters, allowing the accurate assignment of shared and variant-specific peptides. Integration of variability enabled the identification of a comparable number of proteins as the reference proteome while capturing between 28 and 77 % of variant-specific sequences in each isolate, all while maintaining a low false positive rate. These findings demonstrate that accounting for allelic variability substantially improves proteomic coverage and identification confidence, providing a more comprehensive view of the proteome. This approach facilitates a deeper understanding of biological mechanisms and enables precise bacterial proteotyping of Legionella pneumophila isolates.

11
LAMPrEY: a Python-based automated quality control tool for large-scale proteomics datasets

Valdes-Tresanco, M. E.; Wacker, S.; Valdes-Tresanco, M. S.; Plakhotnyk, A.; Brodie, N. I.; Hepburn, M.; Ulke-Lemee, A.; Huttlin, E. L.; Lewis, I. A.

2026-05-11 bioinformatics 10.64898/2026.05.06.722826 medRxiv
Top 0.1%
22.8%
Show abstract

Over the past years, proteomics has moved increasingly towards the analysis of large cohorts of biological specimens. This has been made possible by significant improvements in mass spectrometry technology, chromatographic separation methods, and improved data acquisition strategies. These technological advances now routinely enable experiments that yield vast datasets that substantially outstrip the capacity of existing proteomics data analysis approaches. Processing such large datasets requires purpose-built, quality control tools designed to organize and analyze the data while recording all processing parameters for reproducibility. To address this need, we developed an open-source, Python-based software platform, Large-scale Automated Multi-level Proteomics Evaluation by Python (LAMPrEY), a comprehensive quality-control pipeline for quantitative proteomics analyses of large cohorts of samples. LAMPrEY features GUI-based file submission, automated processing with MaxQuant and RawTools, an interactive analytics dashboard, and an application programming interface (API) for programmatic usage that collectively enable rapid, reproducible analysis and interpretation of proteomics data. We demonstrate the longitudinal monitoring and analytical capabilities of LAMPrEY using TMT11 quantitative proteomics data generated from 910 Enterococcus faecium isolates collected from bloodstream infection patients. LAMPrEY is an open-source software that can be accessed at www.lewisresearchgroup.org/software.

12
A Multimodal Workflow for Spatial Metabolic Neighborhood Mapping in Neural Rosette Cultures

Adebayo, O. N.; Turaga, A.; Chung, M.; Fernandez, F.; Kemp, M. L.

2026-04-13 developmental biology 10.64898/2026.04.13.715964 medRxiv
Top 0.1%
22.8%
Show abstract

Neural rosettes are hallmarks of the neural progenitor cell stage that is a necessary pre-condition for manufacturing central nervous system lineages. Characterization of early changes during differentiation through positional arrangement and metabolic shifts that occur in a multi-day protocol would facilitate cell culture quality monitoring and optimization of batch culture yield. We describe an analytical framework for identifying neural rosettes from confocal microscopy within a colony of differentiating stem cells and translating co-registered, cell-resolved MALDI imaging data into interpretable readouts that are compatible with cell manufacturing needs. Rather than evaluating hundreds of ion images sequentially, the pipeline converts each region of interest into a single-cell feature matrix and summarizes whole-spectrum variation using PCA, graph-based Leiden clustering, and UMAP visualization. The resultant metabolic neighborhoods provide quantification of molecular heterogeneity within colonies and - when mapped back to x-y space - form coherent spatial domains. Together, these outputs create a practical bridge between multimodal MALDI capabilities and process-relevant interpretation: neighborhoods can be compared across conditions, ranked markers can be prioritized as putative critical quality attributes, and spatial organization can be quantified without manual, feature-by-feature screening.

13
Analysis of Confounding Factors in Reactive Cysteine Profiling Reveals Enhanced Chromatin-Protein Association via CDK7 Inhibition by THZ1

Yang, K.; Li, S.; Li, B.; Richards, D.; Dong, K.; Seneviratne, U.; Lee, W.; Iannetta, A.; Xu, H.; Gygi, S.; Yu, Q.

2026-05-07 cell biology 10.64898/2026.05.05.721470 medRxiv
Top 0.1%
22.8%
Show abstract

Recent advances in activity-based proteome profiling (ABPP) have enabled global mapping of cysteine ligandability, uncovering novel biological insights and opportunities for identifying disease vulnerabilities. While both live cell-based and native lysate-based ABPP have been applied, how cysteine ligandability differs between these systems and what factors influence these measurements remain unclear. Building on our previous development of a high-throughput TMT-ABPP workflow for native lysates, here we adapt the protocol for live cells and systematically compare cysteine ligandability across both platforms. Our analysis reveals three major contributors to the discrepancies: in-cellulo cysteine accessibility, protein abundance changes, and protein relocalization. Notably, we highlight that CDK7 inhibitor THZ1 induces substantial protein relocalization and promotes chromatin binding. Together, these results provide a practical framework for ABPP experimental design and data interpretation, supporting more accurate application of ABPP in functional proteomics and drug discovery.

14
Serum proteomics reveals distinct phenotypic signatures to IL-6 blockade between two immunotherapies

Sniezek, C.; Plubell, D.; Vlajic, K.; Hoofnagle, A.; Wu, C. C.; Buckner, J. H.; Schweppe, D. K.; Speake, C.; MacCoss, M. J.

2026-03-30 biochemistry 10.64898/2026.03.27.712461 medRxiv
Top 0.1%
22.7%
Show abstract

A recent clinical study tested the effects of two different monoclonal antibodies (mAbs) (siltuximab, anti-IL6; tocilizumab, anti-IL6R) on the fate and function of T-cells in people with type 1 diabetes. While both mAbs affect the response of T-cells to stimulation, they have very different, sometimes opposing mechanisms. Here, we use mass-spectrometry based proteomics to analyze longitudinal serum samples (baseline and two weeks post-treatment) from 20 clinical trial participants to examine the effects of siltuximab and tocilizumab on extracellular vesicles. To accomplish this, serum samples were enriched for extracellular vesicles with Mag-Net and analyzed by LC-MS/MS to identify significantly differentially abundant protein groups and pathways. Proteome analysis confirmed highly reproducible measurements across multiple draw dates. In total, we quantified >3300 protein groups of which 46 protein groups had significantly altered abundance after mAb treatment. Tocilizumab altered pathways associated with proteostasis (neddylation) and pre-notch transcription and translation. Siltuximab altered FCGR activation pathway members. In addition, quantitation of the monoclonal antibody therapies themselves enabled the measurement of the correlation between drug amounts and impacted proteins. Taken together, this work demonstrates the utility of the Mag-Net method to evaluate the impacts of therapeutic interventions on serum extracellular vesicles.

15
Reference-Based Library Construction Improves Performance in low-input diaPASEF Workflows

Charkow, J.; Ghaznavi, M.; Seale, B.; Peng, J.; Gingras, A.-C.; Rost, H.

2026-05-04 bioinformatics 10.64898/2026.04.29.721088 medRxiv
Top 0.1%
22.6%
Show abstract

In low input mass spectrometry-based proteomics, Data Independent Acquisition (DIA), including diaPASEF, is quickly becoming the method of choice for label free quantification. Whether using empirical or in silico spectral libraries, performance is dependent on the library; however, the optimal library construction strategy for low input proteomics remains an open question. To address this, we examine and develop library construction approaches that are compatible with both spectrum-centric and peptide-centric analysis workflows. These approaches leverage a closely related, high-quality sample to improve library quality. First, we validated our approach in bulk sample amounts where we observed that the effects of gas-phase fractionation based library construction is dependent on the software framework, with improvements more pronounced in OpenSWATH compared to DIA-NN. In OpenSWATH, our peptide-centric library reconstruction workflow consistently outperforms a transfer learning strategy, an emerging alternative approach. In DIA-NN, trends are dependent on library source highlighting OpenSWATHs stronger dependence on the search space. In low-input applications, such as single-cell-equivalent injection amounts (100 pg) of HeLa cell digest on a timsTOF SCP, our library construction approach provided more pronounced improvements across both software tools compared to bulk samples. Using a peptide-centric reconstruction approach with the OpenSWATH analysis framework, we detected over 15,000 peptide precursors (2480 protein groups), a 90% improvement over the original library. Furthermore, using a spectrum-centric construction approach, peptide precursor identification rates improved over 6-fold ([~]1000 to [~]6000). Our strategy provides a practical solution for generating high-quality libraries in low-input applications.

16
Mapping the interactome of human tRNA methyltransferase TRMT1 using dual proximity labeling

D'Oliviera, A.; Olson, S.; Bernhard, H.; Yu, Y.; Mugridge, J. S.

2026-05-19 biochemistry 10.64898/2026.05.18.725941 medRxiv
Top 0.1%
19.0%
Show abstract

Transfer RNA methyltransferase 1 (TRMT1) installs N2-methylguanosine and N2,N2-dimethylguanosine modifications at position 26 of mammalian tRNAs, supporting tRNA structure, translation, and cellular response to redox stress. However, the local environment and interactome of TRMT1 in the cell is poorly defined. Here, we use APEX2-based proximity labeling of the N- and C-terminus of TRMT1, coupled with label-free quantitative proteomics to map candidate TRMT1-proximal proteins in HEK293T cells. Mass spectrometry data was acquired using both data-independent acquisition (DIA) and data-dependent acquisition (DDA) methods, and it was found that DIA substantially increased proximity proteome coverage, reproducibility, and the number of significantly enriched candidate hits compared to the DDA method. N- and C-terminal APEX2-TRMT1 constructs captured largely overlapping proteomes, suggesting the dual-labeling strategy provides a robust map of proximal proteins. Analysis of the significant TRMT1-proximal proteins reveals enrichment in RNA processing and ribonucleoprotein-associated factors, in addition to hits connected to tRNA modification, tRNA biogenesis, and redox-associated biology. These data provide a proteome-scale view of TRMT1-associated cellular proteins and environments, and lay the groundwork for future validation of functional TRMT1 interaction networks. SignificanceO_LIFusing APEX2 enzyme to both N-terminal and C-terminal of the bait enhanced the sensitivity for identification of protein interactions. C_LIO_LICombining APEX2-based endogenous labeling with DIA mass spectrometry increases reproducibility and depth of proximity proteome. C_LIO_LIThe study provides a rich source of potential interacting or proximally close proteins to TRMT1, which warrants further validation studies. C_LI

17
MSstatsResponse: Semi-parametric statistical model enhances detection of drug-protein interactions in chemoproteomics experiments

Szvetecz, S.; Kohler, D.; Federspiel, J.; Field, D. S.; Jean-Beltran, P.; Seward, R. J.; Suh, H.; Xue, L.; Vitek, O.

2026-03-11 bioinformatics 10.64898/2026.03.09.710598 medRxiv
Top 0.1%
19.0%
Show abstract

Chemoproteomics is a popular approach for the identification of small molecule-protein interactions in biological systems. Several chemoproteomics workflows leverage functionalized chemical probes and mass spectrometry to measure protein engagement through direct protein enrichment or competition using a range of small molecule concentrations. Statistical methods for analysis of such dose-response chemoproteomics datasets are limited. For example, existing methods rely on fixed curve shapes and are sensitive to experimental variation, particularly when the number of doses or replicates is limited. Here, we present MSstatsResponse, a semi-parametric statistical framework for analyzing chemoproteomic dose-response experiments that uses isotonic regression that does not require a fixed curve shape. This approach improves the accuracy and robustness of curve fitting, target identification, and half-response estimation across diverse experimental designs. We evaluate MSstatsResponse by generating a benchmark chemoproteomic dataset that profiled the competition between the kinase-binding probe XO44 and the drug Dasatinib using three mass spectrometry acquisition strategies: data-independent acquisition, tandem mass tag-based data-dependent acquisition, and selected reaction monitoring. We further evaluate the method on simulated datasets that vary the number of doses, number of replicates, and levels of noise, and demonstrate that MSstatsResponse consistently improves sensitivity, specificity, and reproducibility compared to existing methods, particularly in low-replicate and low-dose settings. MSstatsResponse is implemented as an open-source R/Bioconductor package that integrates with the MSstats ecosystem for quantitative proteomics. It provides a unified workflow for preprocessing, curve fitting, target identification, and experimental design, enabling researchers to select the number of doses and replicates appropriate to their experimental goals. The software and documentation are freely available at https://bioconductor.org/packages/MSstatsResponse.

18
Classification with Missing Data - A NIFty Pipeline for Single-Cell Proteomics

Nitz, A. A.; Echarry, B.; McGee, B.; Payne, S. H.

2026-03-09 bioinformatics 10.64898/2026.03.06.710179 medRxiv
Top 0.1%
18.9%
Show abstract

Single-cell proteomics (SCP) is uniquely suited for cell-type characterization, trajectory-based inference, and microenvironment mapping. Evaluating biological hypotheses in these experiments requires labeled cells. Without a pre-measurement label, machine learning is used to identify features that characterize the cell types and classify unlabeled samples. Current implementations of annotation methods come with several statistical and computational disadvantages. First, machine-learning methods require complete data, leading to large amounts of missing-value imputation in SCP. Additionally, some machine-learning methods select features and classify samples via cross-sample comparisons, nullifying downstream cross-sample comparisons, like differential expression, through double dipping. Finally, measurements from different proteomic experiments are not directly comparable due to batch effects, significantly limiting the accuracy of classifiers trained on external data. Here we present NIFty, a top-scoring pairs based feature selection method, implemented in a full classification pipeline, that does not require pre-imputed data as input or employ circular analysis techniques, and overcomes batch effects without batch correction. When tested on imputed vs unimputed data, data with large batch effects, and multiclass data, NIFty successfully overcame the targeted classification challenges and comparably, or more accurately, classified the samples in the varied datasets.

19
Systematic characterization of the yeast secretome under diverse proteosynthetic stress conditions reveals secretion of functional ER chaperone BiP

Liu, S.; Schulz, B. L.

2026-05-22 biochemistry 10.64898/2026.05.21.727034 medRxiv
Top 0.1%
18.9%
Show abstract

The yeast secreted proteome plays critical biological roles and influences product and production parameters in industrial fermentation. Systematic profiling of the response of the yeast secretome to intrinsic and extrinsic factors is therefore essential for understanding these functions and for optimizing manufacturing processes. Here, we characterized the yeast secretome under diverse proteosynthetic stress conditions, including glycosylation deficiency, oxidative, reductive, and thermal stresses. The secretome was predominantly composed of conventionally secreted proteins, while a subset of proteins appeared to be secreted via unconventional pathways. Distinct secretome profiles were observed in response to different stressors, driven by a combination of altered intracellular proteomes, altered canonical secretion, and altered cell lysis and unconventional protein secretion, while reflecting the underlying metabolic state of the cells. Heat stress did not impact protein glycosylation but did cause similar protein misfolding stress to N-glycosylation deficiency. Intriguingly, canonically intracellular chaperone BiP was abundant in the secretome in particular stress conditions where its activity would be beneficial. BiP interacted with probable extracellular client proteins in vitro, consistent with it acting as a functional extracellular chaperone/holdase in conditions such as reductive stress in which client proteins could be misfolded outside the cell.

20
Top-down Sequencing of Intact Proteoforms using the timsOmni mass spectrometer: Accurate Determination of Co-occurring Histone Modifications

Berthias, F.; Bilgin, N.; Smyrnakis, A.; Le Boiteux, E.; Kosmopoulou, M.; Albers, C.; Suckau, D.; Mecinovic, J.; Papanastasiou, D.; Jensen, O. N.

2026-05-05 biochemistry 10.64898/2026.05.01.722147 medRxiv
Top 0.1%
18.6%
Show abstract

Deep characterization of intact proteoforms remains an analytical challenge in functional proteomics, particularly for heterogenous multi-site post-translational modifications at distinct amino acid residues. Histones are among the most dynamically and diversely post-translationally modified proteins in eukaryote cells, carrying multiple, co-occurring and reversible modifications that can give rise to isomeric proteoform species. Tandem mass spectrometry with multimodal fragmentation capabilities is a promising approach for deep characterization of intact proteoforms, such as modified histones. We applied the novel timsOmni mass spectrometer, which incorporates the Omnitrap platform enabling multimodal MS workflows, for residue-level mapping of histone modifications, including acetylation and methylation. Recombinant histones H3.1 and H4 were in vitro acetylated by enzymes GCN5, PCAF and p300 to generate mono- and multi-acetylated proteoforms. Complementary MS2 electron- and collision-based dissociation (ECD, EID, RCID and ECciD), together with MS3 strategies, produced complete or near-complete backbone fragmentation of intact protein ions (>92% amino acid sequence coverage). For monoacetylated species generated by the more site-selective lysine acetyltransferases, the dominant proteoform matched the known catalytic preferences of the enzymes (H3.1K14ac for GCN5 and PCAF, and H4K8ac for PCAF), while minor positional isomers were also identified and their relative abundance estimated. In contrast, the broader substrate specificity of p300 produced a wide distribution of H4 proteoforms bearing up to seven acetylated lysine residues. Species carrying six and seven acetylations were characterized by multimodal MS2/MS3 experiments, enabling localization of individual acetylation sites and discrimination of positional isomers. Finally, endogenous histone proteoforms from liver extracts were analyzed, yielding sequence coverages of 92-93% for the most abundant species and enabling confident localization of multiple PTMs (acetylation and methylation). These results illustrate that multimodal MSn fragmentation of intact proteins supports residue-level assignment of combinatorial histone marks and coexisting positional isomers. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=165 HEIGHT=200 SRC="FIGDIR/small/722147v1_ufig1.gif" ALT="Figure 1"> View larger version (34K): org.highwire.dtl.DTLVardef@387ab5org.highwire.dtl.DTLVardef@2410org.highwire.dtl.DTLVardef@13fc392org.highwire.dtl.DTLVardef@140e054_HPS_FORMAT_FIGEXP M_FIG C_FIG HighlightsO_LIMultimodal MS{superscript 2}/MS3 maps histone PTMs on intact proteins. C_LIO_LIECD, EID, RCID, and ECciD provide complete or near-complete sequence coverage. C_LIO_LIMS3 localizes acetylation sites, distinguishes positional isomers. C_LIO_LIEndogenous H4 proteoforms are assigned with site-specific PTM mapping. C_LI